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Abstract 



This paper is a continuation of the study of topological properties of omega context 
free languages (w-CFL). We proved in [Topological Properties of Omega Context 
O \ Free Languages, Theoretical Computer Science, Volume 262 (1-2), 2001, p. 669- 

697] that the class of w-CFL exhausts the finite ranks of the Borel hierarchy, and 
in [Borel Hierarchy and Omega Context Free Languages, Theoretical Computer 
^' , Science, to appear] that there exist some cj-CFL which are analytic but non Borel 

^f^ \ sets. We prove here that there exist some omega context free languages which are 

\^ • Borel sets of infinite (but not finite) rank, giving additional answer to questions of 

Lescow and Thomas [Logical specifications of infinite computations. In:" A Decade 



J£^ ; of Concurrency" (J. W. de Bakker et al., eds), Springer LNCS 803 (1994), 583-621]. 
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1 Introduction 



Since J.R. Biichi studied tlie w-languages recognized by finite automata to 
prove the decidability of the monadic second order theory of one successor over 
the integers [Biic60a] the so called cu-regular languages have been intensively 
studied. See [Tho90] and [PP02] for many results and references. 

Pushdown automata are a natural extension of finite automata. R. S. Cohen 
and A. Y. Gold [CG77] , [CG78] and M. Linna [Lin76] studied the w-languages 
accepted by omega pushdown automata, considering various acceptance con- 
ditions for omega words. It turned out that the omega languages accepted by 
omega pushdown automata were also those generated by context free gram- 
mars where infinite derivations are considered, also studied by M. Nivat [Niv77] 
[Niv78] and L. Boasson and M. Nivat [BN80]. These languages were then called 
the omega context free languages (w-CFL). See also Staiger's paper [Sta97a] 
for a survey of general theory of w-languages, including more powerful accept- 
ing devices, like Turing machines, and the fundamental study of J. Engelfriet 
and H. J. Hoogeboom on X-automata, i.e. finite automata equipped with a 
storage type X, reading infinite words [EH93]. 

Topology is a useful tool for classifying w-languages by the study of their 
complexity, particularly with regard to the Borel hierarchy. 

McNaughton's Theorem implies that w-regular languages ( w-languages ac- 
cepted by deterministic Muller automata) are boolean combination of Il^-sets, 
[MaN66]. Topological properties of cj-regular languages were first studied by 
L. H. Landweber in [Lan69] where he characterized cu-regular languages in a 
given Borel class. 

J. Engelfriet and H. J. Hoogeboom proved that all cu-languages accepted 
by deterministic X-automata with a Muller acceptance condition are also 
boolean combinations of Ila-sets hence (S3 fl n3)-sets. 

When considering non deterministic finite machines, as X-automata, a natu- 
ral question, posed by H. Lescow and W. Thomas in [LT94], now arises: what 
is the topological complexity of w-languages accepted by automata equipped 
with a given storage type X? Are they all Borel sets of finite rank, Borel sets, 
analytic sets? 

It is well known that every w-language accepted by a Turing machine (hence 
also by a X-automaton) with a Muller acceptance condition is an analytic 



set [Sta97a] (i.e. is obtained as a continuous image of a Borel set or as the 
projection of a Borel set [Mos80]). 

We consider in this paper the storage type "pushdown" . We pursue the inves- 
tigation of topological properties of omega context free languages. We proved 
that the class of w-CFL exhausts the finite ranks of the Borel hierarchy, giving 
examples of n°-complete (respectively S^-complete) u-CFL for each integer 
n > 1, [FinOla]. We showed in [FinOOa] that there exist some omega context 
free languages which are analytic but non Borel sets. There exist such u- 
languages in the form L'^, with L a context free finitary language; this gave 
an answer to questions of D. Niwinski and P. Simonnet about omega powers 
of finitary languages [Niw90] [Sim92] . 

But the question was still open whether there exist some omega context free 
languages which are Borel sets of infinite rank. 

We answer to this question in this paper giving examples of w-CFL which are 
Borel sets of infinite rank. 

The paper is organized as follows. In sections 2 and 3, we first review some 

above definitions and results about w-regular, cj-context free languages, and 

topology. 

In section 4 we introduce the operation of exponentiation of sets defined by 

J. Duparc in his recent study of the Wadge hierarchy of Borel sets, which is a 

great refinement of the Borel hierarchy [DupOl], and recall preceding results 

of [FinOla]. 

In section 5, we prove our main result about u-CFL, using an iteration of 

Duparc's operation and give additional answer to questions of W. Thomas 

and H. Lescow [LT94]. 



2 cj-regular and cj-context free languages 



We assume the reader to be familiar with the theory of formal languages and of 

cj-regular languages, see for example [HU69] ,[Tho90]. We first recall some of 

the definitions and results concerning cj-regular and w-context free languages 

and omega pushdown automata as presented in [Tho90] [CG77] [CG78]. 

When E is a finite alphabet, a finite string (word) over S is any sequence 

X = Xi . . .Xk , where Xj G S for i = 1, . . . , /c ,and k is an integer > 1. The 

length of a; is k, denoted by |a;| . 

If |a;| = , a; is the empty word denoted by A. 

we write x{i) = Xi and x[i] = x(l) . . . x{i) ior i < k and x[0] = A. 

E* is the set of finite words over S. 

The first infinite ordinal is u. 



An w-word over E is an w -sequence ai . . . a„ . . ., where Oj G E, Vi > 1. 
When a is an cu-word over S, we write a = a{l)a{2) . . . (j{n) . . . 
and a[n] = o"(l)o"(2) . . . a{n) the finite word of length n, prefix of a. 
The set of w-words over the alphabet S is denoted by S*^. 
An w-language over an alphabet S is a subset of S'^. 

The usual concatenation product of two finite words u and v is denoted u.v 
(and sometimes just uv). This product is extended to the product of a finite 
word u and an w-word v. the infinite word u.v is then the cu-word such that: 
{u.v){k) = u{k) a k < \u\ , and 
{u.v){k) = v(k — \u\) \i k > \u\. 

For 1/ C S*, V^ = {(7 = Ml . . . M„ . . . G S^/wi G V, Vi > 1} is the w-power of 

V. 

For V C S*, the complement of V (in S*) is S* — \/ denoted V~ . 

For a subset A C S'^, the complement oi A'lsiy — A denoted A~ . 

The prefix relation is denoted C: the finite word -u is a prefix of the finite 
word V (denoted u^v) ii and only if there exists a (finite) word w such that 
V = u.w. 

This definition is extended to finite words which are prefixes of w- words: 
the finite word m is a prefix of the w-word v (denoted -u C ti) iff there exists 
an w-word w such that v = u.w. 

Definition 2.1 : A finite state machine (FSM) is a quadruple M = {K, S, 6, Qq), 
where K is a finite set of states, H is a finite input alphabet, qo E K is the 
initial state and 6 is a mapping from K xH into 2^ . A FSM is called deter- 
ministic (DFSM) iff : 5:Kx^^ K. 

A Bilchi automaton (BA) is a 5-tuple M = {K, S, 6, go, F) where M' = 
{K, S, 6, go) is a finite state machine and F (1 K is the set of final states. 
A Muller automaton (MA) is a 5-tuple M = {K,J],6,qQ,J^) where M' = 
{K, S, 6, go) is a FSM and J^ C 2^ is the collection of designated state sets. 
A Bilchi or Muller automaton is said deterministic if the associated FSM is 
deterministic. 

Let a = aia2 . . .an . . . be an u-word over S. 

A sequence of states r = gig2 . . .qn . . . is called an (infinite) run of M = 
{K,Ti,6,qo) on a, starting in state p, iff: 1) qi = p and 2) for each i > 1, 
gi+i G 5{qi,ai). 

In case a run r of M on a starts in state go, we call it simply "a run of M on 
a " . 

For every (infinite) run r = gig2 . . .qn . . . of M, In{r) is the set of states in 
K entered by M infinitely many times during run r: 
In{r) = {g G K/{i > l/g^ = g} is infinite }. 

For M = (K, S, (5, go, -F) a BA , the u-language accepted by M is L{M) = 
{(7 G S'^ / there exists a run r of M on a such that In{r) fl F 7^ 0}. 



For M = {K,Il,S,qo,F) a MA, the u-language accepted by M is L{M) = 
{a G S'^ / there exists a run r of M on a such that In{r) G J-"}. 

The classical result of R. Mc Naughton [MaN66] established that the expressive 
power of deterministic MA (DMA) is equal to the expressive power of non 
deterministic MA (NDMA) which is also equal to the expressive power of non 
deterministic BA (NDBA) . 

There is also a characterization of the languages accepted by MA by means 
of the "cu-Kleene closure" which we give now the definition: 

Definition 2.2 For any family L of finitary languages over the alphabet E, 
the u-Kleene closure of L, is : 

oj-KC{L) = {U^^,U,.Vr/U„V, eL,\/te [l,n]} 

Theorem 2.3 For any u-language L, the following conditions are equiva- 
lent: 

(1) L belongs to u — KC{REG) , where REG is the class of (finitary) regular 
languages. 

(2) There exists a DMA that accepts L. 

(3) There exists a MA that accepts L. 

(4) There exists a BA that accepts L. 

An u-language L satisfying one of the conditions of the above Theorem is 

called an u -regular language. 

The class of u-regular languages will be denoted by REG^. 

We now define the pushdown machines and the classes of w-context free lan- 
guages. 

Definition 2.4 A pushdown machine (PDM) is a 6-tuple M = {K, S, F, 6, go, Zq), 
where K is a finite set of states, H is a finite input alphabet, T is a finite push- 
down alphabet, q^ E K is the initial state, Zq eT is the start symbol, and 5 is 
a mapping from i^ x (S U {A}) x F to finite subsets of K x T* . 
// 7 G F+ describes the pushdown store content, the leftmost symbol will be 
assumed to be on " top" of the store. A configuration of a PDM is a pair {q, 7) 
where q E K and 7 G F*. 

For a G S U {A}, 7, /3 G F* and Z eT, if {p, (5) is in 6{q, a, Z), then we write 
a : (g, Z7) H-»M (p,/37)- 

^^\j is the transitive and reflexive closure of^-^M- (The subscript M will be 
omitted whenever the meaning remains clear). 

Let a = aia2 . . .an ■ ■ ■ be an u-word over S. An infinite sequence of configura- 
tions r = {qi,'~fi)i>i is called a run of M on a, starting in configuration (p, 7), 



(1) (gi,7i) = (P,7) 

(2) for each i > 1, there exists 6j G S U {A} satisfying hi : {qi,'yi) ^m 
(O'i+i, 7i+i) such that either aia2 . . . a„ . . . = 6162 ■ ■ -bn . . . 

or 6162 ■ ■ -bn ■ ■ ■ is a finite prefix of aia2 . . . a„ . . . 

The run r is said to he complete when 0102 • • • a„ . . . = 6162 ■ ■ -bn ■ ■ ■ 

As for FSM, for every such run, In{r) is the set of all states entered infinitely 
often during run r. 

A complete run r of M on a , starting in configuration (go, Zq), will be simply 
called " a run of M on a ". 

Definition 2.5 A Bilchi pushdown automaton (BPDA) is a 7-tuple M = 
{K, S, r, 6, go, Zo, F) where M' = {K, S, T, 6, go, Zq) is a PDM and F C K is 
the set of final states. 

The u-language accepted by M is L{M) = {a E "E^ / there exists a complete 
run r of M on a such that In{r) fl F 7^ 0}. 

Definition 2.6 A Muller pushdown automaton (MPDA) is a 7-tuple M = 
{K, S, r, 6, go, Zo, T) where M' = {K, S, T, 6, go, Zq) is a PDM and J= (Z2^ is 
the collection of designated state sets. 

The u-language accepted by M is L{M) = {c G S"^ / there exists a complete 
run r of M on a such that In{r) G J-"}. 

Remark 2.7 We consider here two acceptance conditions for u -words , the 
Bilchi and the Muller acceptance conditions, respectively denoted 2-acceptance 
and 3-acceptance in [Lan69] and in [CG78] and {inf, Fl) and {inf, =) in 
[Sta97a]. 

R.S. Cohen and A.Y. Gold, and independently M. Linna, established a char- 
acterization Theorem for cj-CFL: 

Theorem 2.8 LetCFL be the class of context free (finitary) languages. Then 
for any u-language L the following three conditions are equivalent: 

(1) Leuj-KC{CFL). 

(2) There exists a BPDA that accepts L. 

(3) There exists a MPDA that accepts L. 

In [CG77] are also studied the w-languages generated by w-context free gram- 
mars and it is shown that each of the conditions 1), 2), and 3) of the above 
Theorem is also equivalent to: 4) L is generated by a context free grammar G 
by leftmost derivations. These grammars are also studied in [Niv77] , [Niv78]. 
Then we can let the following definition: 

Definition 2.9 An u-language is an u- context free language (u-CFL) iff it 



satisfies one of the conditions of the above Theorem. 



3 Topology 

We assume the reader to be familiar with basic notions of topology which may 
be found in [Mos80] [LT94] [Sta97a] [PP02] and with the elementary theory 
of (countable) ordinals. 

Topology is an important tool for the study of w-languages, and leads to 
characterization of several classes of cu-languages. 

For a finite alphabet X, we consider X^ as a topological space with the Cantor 
topology. The open sets of X'^ are the sets in the form W.X'^ , where W C X*. 
A set L C X'^ is a closed set iff its complement X'^ — L is an open set. The 
class of open sets of X^ will be denoted by G or by S^. The class of closed sets 
will be denoted by F or by n°. Closed sets are characterized by the following: 

Proposition 3.1 A set L C X"^ is a closed set of X'^ iff for every a G X"^, 

[Vn > 1, 3-u G X'^ such that a{l) . . . a{n).u G L] implies that a E L. 

Define now the next classes of the Borel Hierarchy: 

Definition 3.2 The classes S° and n° of the Borel Hierarchy on the topo- 
logical space X^ are defined as follows: 
S5 is the class of open sets of X^ . 
n° is the class of closed sets of X^ . 

n^ or G^ is the class of countable intersections of open sets of X^ . 
S2 or Fo- is the class of countable unions of closed sets of X^ . 
And for any integer n> 1: 

^n+i ^'5 ^^^ class of countable unions of 11^- sub sets of X^ . 
n°_,_j^ is the class of countable intersections ofE^-subsets of X^ . 
The Borel Hierarchy is also defined for transfinite levels. The classes S° and 
n°, for a countable ordinal a, are defined in the following way: 
S° is the class of countable unions of subsets of X'^ in U-y<Q,n°. 
n° is the class of countable intersections of subsets of X^ in U^<q,S°. 

Recall some basic results about these classes, [Mos80]: 

Proposition 3.3 

(a) S° U n° C S°_,_j^ n n°_|_j^, for each countable ordinal a > 1. 

(b) U-y<Q,S° = U^<Q,n° C S° n n°, for each countable limit ordinal a. 

(c) A set W C X'^ is in the class S° iff its complement is in the class n°. 

(d) S° — n° 7^ and n° — S° 7^ hold for every countable ordinal a > 1. 



(e) For every ordinal a > 1, the class S° is closed under countable unions 
and the class 11° is closed under countable intersections. 

We shall say that a subset of X'^ is a Borel set of rank a, for a countable 
ordinal a, iff it is in S° U 11° but not in U7<^(S° U 11°). 

There is a nice characterization of n°-subsets of X'^. First define the notion 

Definition 3.4 For W C X*, let: 

W^ = {ae X'^l^^i such that a[i] G W}. 

(a G W^ iff a has infinitely many prefixes in W). 

Then we can state the following Proposition: 

Proposition 3.5 (see [Sta97a]) A subset L of X"^ is a Ill-subset of X"^ iff 
there exists a set W C X* such that L = W^ . 

For X a finite set, (and this is also true if X is an infinite alphabet) there 
are some subsets of X'^ which are not Borel sets. Indeed there exists another 
hierarchy beyond the Borel hierarchy, which is called the projective hierarchy 
and which is obtained from the Borel hierarchy by successive applications of 
operations of projection and complementation. More precisely, a subset A of 
X'^ is in the class Sj; of analytic sets iff there exists another finite set Y and a 
Borel subset B oi {X x Y)'^ such that x e A^3y eY'^ such that {x, y) G B. 
We denote (x, y) the infinite word over the alphabet XxY such that (x, y){i) = 
{x{i),y{i)) for each integer i > 0. 

Now a subset of X'^ is in the class 11]; of coanalytic sets iff its complement 
in X'^ is an analytic set. 

The next classes are defined in the same manner, Sj^^j^-sets of X'^ are projec- 
tions of nj^-sets and nj^^_j^-sets are the complements of Sj^_,_j^-sets. 

Recall also the notion of completeness with regard to reduction by continuous 

functions. 

Let q; be a countable ordinal. A set F C X'^ is a S° (respectively n°)-complete 

set iff for any set E C y^ (with Y a finite alphabet): 

E G S° (respectively E G 11°) iff there exists a continuous function / : Y'^ — )■ 

X"^ such that E = f-\F). 

A similar notion exists for the classes of the projective hierarchy: in particular 

A set -F C X'^ is a S] (respectively nj;)-complete set iff for any set E C Y'^ 

{Y a finite alphabet): 

-E G S]; (respectively E G Ilj;) iff there exists a continuous function / such 

that E = /-1(F). 

A S° (respectively n°)-complete set is a S° (respectively 11°)- set which 
is in some sense a set of the highest topological complexity among the I]° 



(respectively H^)- sets. 

S^ (respectively n°)-complete sets, with n an integer > 1, are thoroughly 

characterized in [Sta86]. 

Landweber studied first the topological properties of w-regular languages. 
He proved that every cu-regular language is a boolean combination of Gs- 
sets. and he also characterized the w-regular languages in each of the Borel 
classes F, G, Fg-, G^, and showed that one can decide, for an effectively given 
cj-regular language L, whether L is in F, G, F^., or G^. 
It turned out that an cu-regular language is in the class G^ iff it is accepted 
by a deterministic Biichi automaton. 

When considering cu-CFL, natural questions now arise: are all w-CFL Borel 
sets of finite rank, Borel sets, analytic sets....? 

First recall the following previous result, [Sta97a]: 

Theorem 3.6 Every u-CFL over a finite alphabet X is an analytic subset of 

We showed the following 
Theorem 3.7 ([FinOOa]) 

(a) There exist some u-CFL which are T\-complete sets hence non Borel 
sets. 

(b) It is undecidable whether an effectively given u-CFL is a Borel set. 

Next the w-CFL exhaust the finite ranks of the Borel hierarchy. 

Theorem 3.8 ([FinOla]) For each non negative integer n > 1, there exist 
H^-complete u-CFL An and Yl^-complete u-CFL En. 

Cohen and Gold proved that one cannot decide whether an cj-CFL is in the 
class F, G or G^. We have extended in [FinOla] this result to all classes I]° 
and n°, for n an integer > 1, and next to all Borel classes in [FinOOa]. (We 
say that an w-CFL A is effectively given when a MPDA accepting A is given). 

But the question was still open whether there exist some omega context free 
languages which are Borel sets of infinite (but not finite) rank. We shall show 
below that there exist such omega context free languages. 



4 Operation " exponentiation of sets " 



In order to construct omega context free languages of every finite rank, we used 
recent results of J. Duparc about the Wadge hierarchy. The Wadge hierarchy 
of Borel sets is a huge refinement of the Borel hierarchy. Wadge gave first a 
description of this hierarchy [Wad84] and Duparc recently got a new proof 
of Wadge's results and he gave a normal form of Borel sets, i.e. an inductive 
construction of a Borel set of every given degree [Dup95a] [DupOl]. In fact we 
shall need in this paper only some of his results. So we shall recall only these 
results and refer to [Dup95a] [DupOl] for more details. 

Duparc's proof relies on set theoretic operations which are the counterpart of 
arithmetical operations over ordinals needed to compute the Wadge degrees. 
In fact we shall only use in this paper the operation of exponentiation over sets 
of infinite words. Moreover we shall consider a slight modification of Duparc's 
operation A — )■ A^ we introduced in [FinOla] and which we recall now: 

Definition 4.1 Let X^ be a finite alphabet and <^ ^ X^. 

Let X = Xa U {<^} and x be a finite or infinite word over the alphabet X . 

Then x*^ is inductively defined by: 

X'^ = X, 

For a finite word u G {Xa U {<^})*-' 

iu.a)*^ = u*^.a, if a G Xa, 

{u. «-)*^ = u*^ with its last letter removed if {u'^l > 0, 

{u. -w-)*^ is undefined if \u*^\ = 0, 

and for u infinite: 

{u)'^ = \im.neuj{u[n])'^ , where, given f3n and v in X^, 

V C lim„e^ f3n ^ 3nyp > n I3p[\v\] = v. 

Remark 4.2 For x G X-'^ , x*^ denotes the string x, once every *^ occuring 
in X has been "evaluated" to the back space operation ( the one familiar to 
your computer!), proceeding from left to right inside x. In other words x*^ = x 
from which every interval of the form "a <^ " (a E Xa) is removed. We add 
the convention that {u. <^)*^ is undefined if {u'^l = 0, i.e. when the last letter 
<^ can not be used as an eraser (because every letter of Xa in u has already 
been erased by some erasers *^ placed in u). Remark that the resulting word 
x*^ may be finite or infinite. 

For example if -u = (a «-)", for n > 1, -u = (a <^)'^ or -u = (a <^<^)'^ then 

{ur = A, 

if M = {ah <^)'^ then (m)*" = a"^, 
if -u = 66(<^ a)'^ then (u)*^ = b, 
if u =<^ (a <^)'^ or -u = a <^«- a'^ then (m)*^ is undefined. 

We can now define the variant A — )■ A~ of the operation of exponentiation of 
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sets: 

Definition 4.3 For A C X'X and ^ ^ Xa, let X = XaU {^} and 

A"" = {xe {Xa U {^})'^/x^ G A}. 

The following result is then another formulation of a property of the operation 
A — )■ A~ proved in [DupOl] and which was applied in [FinOla] to study the 
cu-powers of finitary context free languages. 

Theorem 4.4 Let n be an integer > 2 and A C X^ be a Il^-complete set. 
Then A^ is a II^_^_i- complete subset of {Xa U {<^})'^. 

We proved that the class CFL^ is closed under this operation A — )■ A~. 

Theorem 4.5 ([FinOla]) Whenever A C X^ is an u-CFL, then A^ C 
{Xa U {'^}Y is an u-CFL. 

Proof. An w-word a G A~ may be considered as an w-word a*^ G A to which 

we possibly add, before the first letter cr*^(l) of a'^ (respectively between two 

consecutive letters o"*^(n) and a'^{n+ 1) of o""^), a finite word Vi (respectively 

Vn+i ) where: 

for all integers i > 1, Vi belongs to the context free (finitary) language L3 

generated by the context free grammar with the following production rules: 

S* — 7> aS* <^ S* with a G Xa, 

5* — )■ A (A being the empty word). 

this language L3 corresponds to words where every letter of Xa has been 

removed after using the back space operation. 

Remark 4.6 Recall that a one counter automaton is a pushdown automaton 
with a pushdown alphabet in the form T = {Zq,z} where Zq is the bottom 
symbol and always remains at the bottom of the pushdown store. And a one 
counter language is a (finitary) language which is accepted by a one counter 
automaton by final states. It is easy to see that in fact L3 is a deterministic 
one-counter language, i.e. L3 is accepted by a deterministic one-counter au- 
tomaton. And for a G Xa, the language L^.a is also accepted by a deterministic 
one-counter automaton. 

Then we can see that whenever A C X^, the cu-language A~ C (X^U{<^})'^ 
is obtained by substituting in A the language L^.a for each letter a G Xa, 
where L3 is the CFL defined above. 

Let now A be an u-CFL given hj A = IJ"^-,^ Ui.V^ where f/j and Vi are con- 
text free languages. Then A~ = ljr=i(f^i)-^j'^) where U[ (respectively ¥[) is 
obtained by substituting the language L^.a to each letter a G Xa in f/j (re- 
spectively Vi). 
The class CFL is closed under substitution, so U[ and ¥( are CFL. Hence the 
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w-language A~ is an w-CFL because u — KC{CFL) C CFL^^. 

We have also given in [FinOla] an effective construction of a MPDA accept- 
ing the w-language A~ C [Xa U {<^})'^ from a MPDA accepting an u- 
language A C X^. Recall now the idea of this construction. 

Let A be an cu-CFL which is accepted by a Muller pushdown automaton 
A = {K, Xa, T,S,qo, Zq,J^). The cu-language accepted by A is L{A) = A = 
{a G X^ / there exists a run r of ^ on o" such that In{r) G J-"}. 

We can construct another MPDA A^ which accepts the w-language A'^ over 
the alphabet X = Xa U {«-}. 

Let us describe informally the behaviour of the machine A^ when it reads an 
w-word a G A~. Recall that this word may be considered as an w-word a*^ G A 
to which we possibly add, before the first letter cr'^{l) of a*^ (respectively 
between two consecutive letters cr'^(n) and o"*^(n + 1) of c*^), a finite word vi 
(respectively f„+i ) where Vi belongs to the context free language L3. 

A^ starts the reading as a pushdown automaton accepting the language L3. 
Then A^ begins to read as A, but at any moment of the computation it may 
guess (using the non determinism) that it reads a finite segment v of L3 which 
will be erased (using the eraser <^). It reads v using an additional stack letter 
E which permits to simulate a one counter automaton at the top of the stack 
while keeping the memory of the stack of A. Then, after the reading of v, A^ 
simulates again the machine A and so on. 



u-CFL which are Borel of infinite rank 



A well known example of n2-complete w-regular language is 

B2 = {ae {0, l}'^/3^i a{i) = 1} = (OM)'^, 

where 3'^i means: " there exist infinitely many i such that . . ." . 

B2 is an omega context free language because it is an w-regular language. 

We can now get some n°_j,2"Complete set, for an integer n > 1, from the II"- 
complete set B2 by applying n > 1 times the operation of exponentiation of 

sets. 

More precisely, we define, for a set A C X^: 

A~-i = A« and 

where we apply k + 1 times the operation A — )> A~ with different new letters 

^1, <^2, <^3, •••, ^fc+1- 
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We can now infer from Theorems 4.4 and 4.5 that, for an integer n > 1, 
(i?2)~'" is an omega context free language which is a n°^2"Complete subset of 
{0, 1, «-i, . . . , «-n}'^. Similarly, if A C X^ is a Il^-complete regular or context 
free cj- language over the alphabet Xa, the w-language (A)~-" is a n°_,_2- 
complete subset of {Xa U {<^i, . . . , <^n})'^- 

A way to obtain a Borel set of infinite rank, as we shall show below, is to 
define, for two letters a, b in Xa, the supremum of the sets A~'*: 



supA^-' = [ja\b.A-' 

But this set is defined over an infinite alphabet, and any omega context free 
cu-language is defined over a finite alphabet. So we have first to code this set 
over a finite alphabet. We shall first code every set A~'". The w-language A~"^ 
is defined over the alphabet Xa U {<^i, . . . , <^n} hence we have to code every 
eraser <^j by a finite word over a fixed finite alphabet. We shall code the eraser 
■«-j by the finite word a.BKCKDKEK(5 over the alphabet {a,B,C,D,E, /3}. 
The reason of the coding we choose will be clear later, when we construct 
a Muller pushdown automaton accepting an w-language close to the cod- 
ing of supjgpj A~'*. In fact this MPDA needs to read four times the integer j 
characterizing the eraser «-j. 

Remark first that one can define the morphism 

F„ : (Xa U {^1, . . . , ^n}y -^ {Xa U {a, /3, B, C, D, E}y 

by F{c) = c for each c G Xa and F{*^j) = a.BKCKDKEK(5 for each integer 
j G [1, "^J, where -B, C, D, E, a, /3 are new letters not in Xa- This morphism is 
naturally extended to a continuous function 

Fn : {Xa U {^i, . . . , ^4)" ^ {Xa U {a, /3, B, C, D, Ej^ 

Then F„((XaU{<^i, . . . , <^n})'^) is the continuous image by F„ of the compact 
set {Xa U {<^i, . . . , <^n})'^, hence it is also a compact set, and a closed subset 
of {Xa U {a, (3, B, C, D, E}y. We can now state the following lemma. Its proof 
is easy and left to the reader. 

Lemma 5.1 Let A C X^ be a Yl^-complete subset of X^. Then for each 
integer n>l, the u-language F„(A~'") is a n°_,_2-comj>/ete subset of {Xa U 
{a,/3,B,C,D,E}r. 

We shall prove now that the supremum of the sets -F„(A~") is a Borel set of 
infinite rank. 
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Lemma 5.2 Let A C X^ be a Il2-complete subset of X'^. Then the set 



n>l 



n>l 



zs a Ti^-subset of {Xa U {a, /3, B, C, D, E})'^ which is not a Borel set of finite 
rank. 

Proof. Assume A C X^ is n2-complete. Then the preceding lemma imphes 
that, for each n > 1, the cj-language F„(yl~-") is a n°_,_2-complete subset of 
{Xa U {a, /3, -B, C, D, -E})'^. Let a, 6 be two letters in X^ then it is easy to show 
that, for each n > 1, the set a".6.-F„(A~'") is also a n°_|_2-complete subset of 
{Xa U {a, /3, 5, C, D, E})^ thus 



supF„(A"-")= Ua".6.F„(A= 



?i>i 



n>l 



is in the class S° by definition of this class. 

On the other side this set cannot be a Borel set of finite rank. Because if 
sup„>i -F„(A~") was in the class LI?, for an integer j > 1, then the set 

a".6.F„(A^-") = sup F„(A^-") n (a".6.(XA U {a, /3, 5, C, D, ^})^) 

n>l 

would be also in the class L!?, because (a".6.(X^ U {a, (5,B,C,D,E}Y) is a 
closed hence LE^-set and the class of n?-subsets of (XaU{q;, /3, -B, C, D, -E})'^ is 
closed under finite intersection. But this would lead to a contradiction because 
we have seen that, for n > j, the set a".6.F„(A~'") is n°^2"Complete, where 
n + 2>j + 2> j hence it is not a n^-subset of {Xa U {a, f3, B, C, D, E})'^. D 

We can not show that the cu-language sup„>i F„(A~-") is an omega context 
free language. This is connected to the fact that the finitary language 

{B^CW^E^ / J > 1} 

is not a context free language. But its complement is easily seen to be con- 
text free. Then, instead of considering sup„>i F„(A~"), we can add to this 
cj-language all w-words in the form aJ^.h.u where there is in -u a segment 
a.BKC^ .DKE^-.P, with j, /c, /, m integers > 1, which does not code any eraser, 
or codes an eraser <^j for j > n. Then we add to sup„>i -F„(A~'") another oo- 
language which is of Borel rank 2 and the resulting w-language will be still 
of infinite rank, but we shall show that it is an omega context free language. 

So we define now formally this construction in the following way. 

Define first the following context free finitary languages over the alphabet 
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L^ = {a^.h.u.B^ / n > 1 and j > n and M G (X°)*} 

L^ = {a^.h.u.C^ / n > 1 and j > n and M G {X^Y} 

L^ = {a'^.b.u.D^ / n > 1 and j > n and M G (X°)*} 

L^ = {a^.h.u.E^ / n > 1 and j > n and M G (X°)*} 

L(^'^) = {u.a.B^.C^.DKE'^.13 / 3,k,l,m > 1 and j y^ k and m G (X°)*} 

l{c,D) ^ {u.a.B^.C''.D^.E"'.p / j, A;, /, m > 1 and A; ^ / and m G {X^Y} 

L^D'^) = {u.a.B^.C''.D^.E"'.p / j, A;, /, m > 1 and / ^ m and m G {X^Y} 

Let now 

L = L^ U L^ U L^ U L^ U L^^'^) U L^^'^) U L^^'^) 

It is easy to show that each of the languages L^, L^ , L^, L^, L'-^'^\ L^^^^\ L^^'^) 
is a context free finitary language thus L is also context free because the class 
CFL is closed under finite union. Then the w-language L.{X^Y is an w-CFL 
which is an open subset of [X^Y . 

Remark now that any word in sup„>]^ -F„(A~'"') belongs to the regular u- 
language 

R = a+.b.{XA U (a.5+.C+.D+.E+./3))^ 

because every word has an initial segment in the form a". 6 with n > 1 and 
the letters a, B, C, D, E, /3 are only used to code the erasers «-j for j > 1. 

Consider now the w-language 

L.{x^Y^R 



An w-word a in this language is a word in R such that a has an initial word 
in the form a". 6, with n > 1, and a contains a segment a.BKC^ .d} .E^-.p with 
j, k,l,m > 1 which does not code any eraser «-j or codes such an eraser but 
with j > n. Thus this w-language is disjoint from the set sup„>]^ F„(A~-"). 
Consider now the w-language: 



A' = supF„(A^-") U [L.{X"Y n R] 

n>l 

We can now state the next lemma. 

Lemma 5.3 Let A C X% be a Ii%-complete subset of X%. Then A' is a S°- 
subset of {Xa U {q;, /3, 5, C, D, E}Y which is not a Borel set of finite rank. 
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Proof. Let A C X^ be a Il^-complete subset of X^. Then we have aheady 
seen that sup„>i F„(A~-") is a S°-subset of (X^ U {a, /3, 5, C, D, E})'^. On the 
other side it is easy to see, from proposition 3.5, that the w-regular language R 
is a Ilj-set because 

R = {R'Y 

where R' is the finitary (regular) language defined by 

R' = a+.b.{XA U {a.B+.C+.D+.E+.f3))+ 

Then the w-language 

L.(x°)'^ni? 

is the intersection of an open set and of a n2-set. Thus it is also a n2-set 
because the class 112 i^ closed under finite intersection. Then the w-language 

A* = sup F„(A^-") U [L.(X°)^ n R] 

n>l 

is a S°-subset of {Xa U {a, /3,B,C,D,E})'^ because the class S° is closed 
under finite union. 

We want now to prove that A* is not a Borel set of finite rank. Assume, on 
the contrary, that A* is of finite rank J, where J is an integer > 1. Then 
the intersection of A* and of the complement of L.(X°)'^ fl R would be the 
intersection of a Il^+i-set and of a S" hence Ilg-set. Hence 



supF„(A^-") = A'n (L.(x°)'^ nR)- 

n>l 

would be a n°-set, with k = max{3, J+1). But this is not possible because we 
know from the preceding lemma that sup„>i F„(A~") is a Borel set of infinite 
rank. D 

We can now state the following 

Theorem 5.4 Let A C X^ be an u-regular language over the alphabet Xa- 
Then the u-language 

A' = supF„(A^-") U [L.(X°)^ n R] 

n>l 

is an u-CFL over the alphabet {Xa U {a, f3, B, C, D, E}). 
Proof. We have already seen that L.(X°)'^ is an w-CFL, thus 

L.(x°)'^ni? 
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is also an u-CFL because the class of omega context free languages is closed 
under intersection with w- regular languages, [CG77]. 

Suppose the w-regular language A C X^ is accepted by the deterministic 
Muller automaton A = {K, Xa, S, go, J^a) where A' = {K, X^, S, go) is a FSM 
and J-'_4 C 2^ is the collection of designated state sets. 

We shall find a MPDA B accepting an cu-CFL L{B) such that 

supF„(A^-") C L{B) C A* = supF„(A^-") U [L.(X°)^ n R] 

n>l ji>l 



Thus we shall have 



A* = L{B)U[L.{X°)'^nR] 

And this will imply that A* is an u-CFL because the class CFL^ is closed 
under finite union [CG77]. 

It is easy to have L{B) C R because if L{B') is an u-CFL which is not included 
into R one can replace it by L{B) = L{B')r\R which is then an u-CFL verifying 
L{B) = L{B')nRCR. 

Recall now that 

is the set of all w- words in R having an initial segment in the form a". 6, with 
n > 1, and containing a segment a.BKC^ .d} .E^.(5 with j,k,l,m > 1 which 
does not code any eraser «-j or codes such an eraser but with j > n. 

Thus, in order to define the MPDA B, we have only to consider the behaviour 
of B when reading w-words in the form 

a'^.b.u 

where n > 1 and m G {Xa U {a.B+.C+.D+.E+./3))'^ is such that the letters 
a, B, C, D, E, P in u are only used to code the erasers <^j for 1 < j < n. 
( In order to simplify our notations, we shall sometimes write in the sequel 
<^j= a.BKCKDKEK(5 and call eraser either -m-j or its code a.BKCKDKEKp, 
with j > 1 ). 

And we have to find a MPDA B such that L{B) contains such a word aJ^.h.u 
if and only if -u G F„(A~-"). 



"), 



So we have to look first at w- words in F„(A~'"). In such a word a G Fn{A 
there are (codes of) erasers «-i, . . . , «-„. The cj-word a is in F„(A~") if and 
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only if after the operations of erasing ( with the erasers «-i, ..., «-„ ) have been 
achieved in a, then the resulting word is in A. 

Because of the inductive definition of the sets A^", the operations of erasing 
have to be done in a good order: in an cj-word which contains only the erasers 
«^i, ...,«-„, the first operation of erasing uses the last eraser «-„, then the 
second one uses the eraser <^n-i, and so on . . . 

Therefore these operations satisfy the following properties: 

(a) An eraser «-j may only erase letters c G Xa or other erasers «-fc with 
k<j. 

(b) Assume that in a word u G -F„(A~'"'), there is a segment c.v.x where c 
is either in Xa or in the set {«-i, . . . , «-n-i}, and x is (the code of) an 
eraser <^k which erases c when the operations of erasing are successively 
achieved. Now if there is in the segment v (the code of) an eraser «-j 
which erases e, where e G Xa or e is (the code of) another eraser, then 
e must belong to v (it is between c and x in the word u); moreover the 
operation of erasing using the eraser ^^j has been achieved before that 
one using the eraser -w-fc and this implies that k < j. Thus the integer k 
must verify: 

k < min[ j / an eraser <^j has been used into v] 

We can now informally describe the behaviour of the MPDA B when reading 
a word a^.b.u such that the letters a, B, C, D, E, (3 are only used in u to code 
the erasers <^j for 1 < j < n. 

After the reading of the initial segment in the form a^.h, the MPDA B simu- 
lates the Muller automaton A until it guesses, using the non determinism, that 
it begins to read a segment w which contains erasers which really erase and 
some letters of Xa or some other erasers which are erased when the operations 
of erasing are achieved in u. 

Then, using the non determinism, when B reads a letter c G Xa it may guess 
that this letter will be erased and push it in the pushdown store, keeping in 
memory the current state of the Muller automaton A. 

In a similar manner when B reads the code <^j= a.BKCKDKEK[5 of an eraser, 
it may guess that this eraser will be erased (by another eraser -m-^ with k > j) 
and then it pushes in the store the finite word 'y.E^ .e, where 7, E, e are in 
the pushdown alphabet. 

But B may also guess that the eraser <^j= a.BKC^ .DKEKP will really be 
used as an eraser. If it guesses that the code of <^j will be used as an eraser, 
B has to pop from the top of the pushdown store either a letter of c G Xa or 



the code 'j.E^.e of another eraser <^j, with i < j, which is erased by <^ 
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It would be easy for B to check whether i < j when reading the initial segment 
a.B^ of *^j. 

But as we remarked in item (b) above, the MPDA B has also to check that 
the integer j is smaller or equal than every integer p such that an eraser -m-^ 
has been used since the letter c G Xa or the code '-/.E^.e was pushed in the 
store. Then, after having pushed in the pushdown store some letter x G Xa 
or the code x = 'j.E^.e of an eraser, and before it pops it from the top of the 
store, B has to keep in the memory of the stack the integer 



k = min[p / some eraser «-p has been used since x was pushed in the stack ] 

For that purpose B pushes the finite word L2.S^.Li in the pushdown store 
(Li is pushed first, then S^ and the letter L2 are pushed in the stack), where 
Li, L2 and S are new letters added to the pushdown alphabet. 

So, when B guesses that *^j= a.BKCKDKEK[5 will be really used as an eraser, 
there is at the top of the stack either a letter c G Xa or a code 'j.E^.e of an 
eraser which will be erased or a code L2.S^ .L\. The behaviour of B is then as 
follows. 

Assume first there is at the top of the stack a code L2.S^.Li. Then B firstly 
checks that j < k when reading the segment a.BKC of the eraser a.B^ .CKD^ .EK[i. 
li j < k holds, then B pops completely, using A-transitions, the word L2.S^ .Li 
from the top of the stack. {B has checked it is allowed to use the eraser <^j). 

Then there is now in every case at the top of the stack either a letter c G Xa or 

a code '-/.E^.e of an eraser which will be erased. The MPDA B pops this letter 

c or the code '-/.E^.e ( having checked that j > i after reading the segment 

a.BK& of the eraser a.BK& .DKEK(5 ). 

We have to consider what is now at the top of the stack and distinguish three 

cases: 

(1) If there is now at the top of the stack the bottom symbol Zq, then the 
MPDA B, after having completely read the eraser «-j, may pursue the 
simulation of the Muller automaton A or guesses that it begins to read 
another segment v which will be erased, hence the next letter c G Xa or 
the next code a.B^ .C^ .D"^ .E™- .(5 of the word will be erased and then 
B pushes the letter c G Xa or the code 'y.E"^.e of «-„ in the pushdown 
store. 

(2) If there is now at the top of the stack either a letter c' G Xa or a code 
j.E^^.e, then B pushes the code L2.SKL1 in the pushdown store ( j is 
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then the minimum of the set of integers p such that an eraser <^p has 
been used since the letter c' or the code '-f.E"^.e has been pushed into the 
stack). 
(3) If there is now at the top of the stack a code L2.SKL1, then the MPDA B 
has to compare the integers j and / and to replace L2.SKLi by L2.S^ .Li 
if j < I. B achieves this task while reading the segment DKEK(5 of the 
eraser a.BKCKDKEKp. 

The MPDA B pops a letter S for each letter D read. It then determines 
whether j < L 

li j < I then B pushes L2.SKL1 when reading the segment EKI5 of the 
eraser -m-j. 

If j > I, then when every letter S of the code L2.S''.Li has been popped, 
there are {j — I) letters D of the eraser a.BKCKD^ .EK(5 which have not 
yet been read by B. When reading these letters the MPDA B pushes {j—l) 
letters U in the stack, where f/ is a new letter in the pushdown alphabet. 
Then, when reading the segment E^ of the eraser a.BKC^ .DKE^P, the 
MPDA B firstly pops f/-'"' (when reading the first (j — /) letters E); 
Afterwards B pushes again S*' in the stack when reading the rest of the 
eraser a.B^ .CKDKEKp. 

When the content of the stack is again just Zq, the initial stack symbol of 
the MPDA B, then B may pursue the simulation of the Muller automaton 
A or guesses it begin to read a new segment which will be erased when the 
operations of erasing will be successively achieved. D 

We can now state our main result: 

Theorem 5.5 Let A C X^ he a Yl^-complete u-regular language over the 
alphabet X A. Then A' is an omega context free language which is a H^-suhset 
of {Xa U {a, /3, -B, C, D, E})'^ hut is not a Borel set of finite rank. 

Proof. It follows directly from Lemma 5.3 and Theorem 5.4 D 

In particular if B2 = {a e {0, lY /3^i a{i) = 1} = (OM)'^, then {B2)' is an 
omega context free language which is a Borel set of infinite rank. 

Theorem 5.5 provides infinitely many such u-CFL over any finite alphabet X 
of cardinal > 2, because there exist infinitely many n2-complete w-regular 
languages over the alphabet X, and for such an tu- regular language A it holds 
that 



A'nX^ = a+.6.A 
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6 Concluding remarks and further work 



We knew that the class of omega context free languages exhausts the finite 
ranks of the Borel hierarchy and that there exist some u-CFL which are ana- 
lytic but non Borel sets. We have proven above that there exist some omega 
context free languages which are Borel sets of infinite rank. 

It is well known that Turing machines, with a Biichi or Muller acceptance 
condition, accept w-languages of every Borel rank < uf^ , where u^^ is the 
first non recursive ordinal [Sta97a] [Mos80] [Sim92] . Then the following problem 
naturally arises: describe the set of infinite Borel ranks of omega context free 
languages, and in particular find the ordinal 



sup{a / there exist some Borel u — CFL of rank a} 

which is of course < u^^ . Unfortunately we cannot reach some Borel ranks 
> cu by iterating our operation A ^ A* . In fact one cannot even reach some 
I]°-complete set, as it will be explained in [FinOlc] by considering the Wadge 
degrees of Borel sets. 

Recall that the Wadge hierarchy of Borel sets is a great refinement of the Borel 
hierarchy. We proved in [FinOlb] that the length of the Wadge hierarchy of 
Borel cu-CFL is an ordinal greater or equal to the Cantor ordinal Eq, which is 
the first fixed point of the ordinal exponentiation of base oo. Using the above 
construction of A* , we have improved this result, showing that this length is 
an ordinal greater than or equal to e^^, which is the u^^ fixed point of the 
ordinal exponentiation of base w, [FinOlc]. 



Acknowledgements. Thanks to the anonymous referees for useful comments 
on a previous version of this paper. 
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